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Without assuming necessary conditions for observers such as galaxies or entropy production, we 
show that the causal patch measure predicts the coincidence of vacuum energy and present matter 
density. Their common scale, and thus the enormous size of the visible universe, has its origin in 
the number of metastable vacua in the landscape. 



Introduction The smallness of the cosmological con- 
stant, A, has long posed a hierarchy problem: why is 
the energy of the vacuum at least sixty orders of mag- 
nitude smaller than the natural value expected from the 
Standard Model [THS]? 

The actual magnitude of the cosmological constant was 
measured more recently @Hi]; it poses, in addition, a 
coincidence problem. The vacuum energy, 

PA = ~ * 1.4 x 1(T 123 , (1) 

is comparable in magnitude to the present matter density. 
(We use Planck units, with c = H = G = 1.) This consti- 
tutes a coincidence of two a priori unrelated timescales: 

tA ~ ^obs , (2) 

— 1/2 

where t\ ~ p A is the time at which vacuum energy be- 
gins to dominate, and i Q b s is the time at which observers 
exist. 

In a theory that contains a sufficiently dense discre- 
tuum of values of A, anthropic arguments can explain 
the smallness of A. Weinberg [7j argued that observers 
require galaxies, which form only if t\ > t ga i, where t ga j is 
the time at which density perturbations grow nonlinear. 
This approach successfully predicted a nonvanishing cos- 
mological constant, and it explains the coincidence that 
t\ ~ igaj. It has the following limitations: 

• It does not explain the coincidence i b s ~ t&. 
(There is no reason a priori why i b s should not 
vastly exceed £ ga i-) 

• It applies only to observers whose existence de- 
pends on the formation of galaxies. 

• It actually favors values of A a few orders of mag- 
nitude larger than the observed value. 

• It explains one unnaturally large timescale, t\, in 
terms of another, i ga j. This scale, in turn, arises 
from a combination of unnaturally small quantities 



whose fundamental origin is not clear (the strength 
of primordial density perturbations and the tem- 
perature at matter-radiation equality). The corre- 
lation of t\ with i ga i is a highly nontrivial predic- 
tion, but it does not address the fundamental origin 
of their enormous scale. 

• The prediction was based on a cosmological mea- 
sure, observers per baryon, that has been ruled out 
phenomenologically [8]. 

The causal patch measure transcends the above limita- 
tions. It explains the coincidence ^ from geometric 
properties, assuming only that observers are made of 
matter or radiation; and it relates the absolute scale, 
to the size of the landscape. 

Results Without assuming specific conditions neces- 
sary for observers, we will derive three results. The first, 
t\ ~ tabs, solves the coincidence problem and predicts 
that all observers are most likely to find themselves at 
the onset of vacuum domination, independently of their 
nature and of t b s - Our second result, t c ~ t b s , predicts 
another coincidence: that the timescale associated with 
spatial curvature is comparable to i b s - Our third result, 
^obs ~ A/" 1 / 2 , explains the origin of the enormous phe- 
nomenological scales governing the visible universe, such 
as its size and age, in terms of the number of landscape 
vacua that can be cosmologically produced and which 
contain observers, Af. It can be regarded as a predic- 
tion for At. We stress that Af will be smaller than Af, 
the total number of vacua, due to anthropic and cosmo- 
logical selection effects, neither of which we will consider 
in this paper. Intriguingly, preliminary analyses 110] 
suggest k^A/" 1 / 2 ) ~ 0(100) for a class of vacua [TT]. If 
this class dominates the landscape, then our prediction 
implies that log Af is not very much smaller than log Af, 
and that the fundamental origin of hierarchy lies in the 
number of topological cycles of complex three-manifolds. 

Relation to other work Polchinski [2] first articulated 
the question of the ultimate origin of the scale of the 
cosmological constant in the context of the landscape. 
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Refs. [31[T2] anticipated the answer reached in the present 
paper but not its derivation. Less general derivations 
have been proposed for small subsets of the landscape 
(vacua that differ from ours only in a few parame- 
ters) [I3J or under the assumption that observers 
arise in proportion to the entropy produced [HI [T5] . A 
direct antecedent |16j of our arguments employed the 
causal diamond, a measure that is somewhat less well- 
defined than the causal patch, and which suppresses the 
subtle role played by curvature in the present analysis. 

Because the three timescales are well-defined across 
the entire landscape and no specific anthropic assump- 
tions are made, our results apply to arbitrary observers 
in arbitrary vacua. Other parameters like the masses of 
leptons [T7] or the timescales of structure formation [T3] , 
may be correlated with i bs, but they arc defined only in 
small portions of the landscape and will not be consid- 
ered here. In a separate publication, we will study other 
measures, and we will argue that currently no measure 
is viable in the domain of dependence of a spacelike sin- 
gularity, such as regions with negative vacuum energy. 
Here, we restrict to vacua with A > 0. 

Our arguments differ significantly from those of 
Refs. [IH1 [H], which appeal to the discreteness of the 
string landscape to cut off a probability distribution that 
favors small values of the cosmological constant expo- 
nentially in A -1 , regardless of i b s . These approaches 
appear to conflict with observation because small A is 
selected too strongly. (Vacua in which observers arise 
by dynamical evolution from a low-entropy initial state 
are extremely rare in a realistic landscape, compared to 
vacua in which observers can be formed only by quantum 
fluctuations but with probability greater than e _1 / A . So 
the vacuum with smallest A is likely in the latter class, 
and one predicts observations that are incompatible with 
a long semiclassical history.) 

Derivation The relative probability for two outcomes 
of a cosmological measurement is given by P1/P2 — 
N1/N2, where Ni is the expected number of times each 
outcome occurs in the universe. Thus, the Ni play the 
role of an unnormalized probability distribution. A dis- 
tribution dp/dx over a continuous parameter x can be 
computed as the number dN of outcomes occurring in 
the range (x, x + dx). 

The landscape of string theory contains long-lived 
de Sitter vacua which give rise to eternal inflation [S]. 
Globally, every experiment and every possible outcome 
occurs infinitely many times: Nj —00. To obtain well- 
defined relative probabilities, these divergences must be 
regulated: this is the measure problem of eternal infla- 
tion. Here we consider the causal patch measure [HI HD] , 
which restricts to the causal past of a point on the fu- 
ture boundary of spacetime (see Fig. [I]), in which Ni 
is to be computed. (Because geometric cutoffs such as 
the causal patch disrupt the worldlines of some particles, 
they require a justification in terms of a physical mech- 




FIG. 1. Conformal diagram of a portion of the multiverse. 
The dashed line shows an infinite hyperbolic surface of con- 
stant FRW time tabs- The causal patch (shaded) restricts 
attention to the finite portion that lies within the event hori- 
zon. 



anism 2IJ. This remains an important open problem 
which we do not address here.) 

Consider an arbitrary observer who lives at an FRW 
time of order t a ^ s . What order of magnitude for t A and 
t c is he likely to observe? This is described by the prob- 
ability distribution over logt^ and logi c at fixed logi b s , 
which can be written as 



dp 



dp 



d\ogt A dlogt c d\ogt A d\ogt t 



n ohs (log t c , log t\ ; log t ohs ) . 

(3) 



We will begin by computing this distribution; later we 
will allow t bs to vary as well. 

The first factor in Eq. ^ is the "prior probability" ; it 
corresponds to the expected number of times a vacuum 
with specified values of log^A and logt c is nucleated in 
the causal patch |22j . This is proportional to the number 
of vacua in the landscape with specified values of log t A 
and log t c , multiplied by the rate at which such vacua are 
produced cosmologically from specified initial conditions. 
But this rate will be independent of t A and t c in the 
regime of interest (t A ^ 1). In the string landscape, 
a decay changes A enormously compared to the energy 
scales associated with t A and t c in the daughter vacuum. 
Thus, the decay chains leading to vacua of interest have 
no information about the eventual values of t A and t c . 

The cosmological constant in a metastable vacuum in- 
cludes large nongravitational contributions, so zero is not 
a special value for the sum. Hence, the density of vacua, 
dN /cZA, can be Taylor-expanded around A = 0. Vacua 
with A ~ 1 contain only a few bits of causally connected 
information [23] , and hence no complex systems of any 
kind. Thus, we may restrict attention to vacua with 
A -C 1 and keep only the leading order in the expan- 
sion, dAf/dA = const [7]. With t A ~ A~ 1/2 , this implies 



dp 



d log t A d log t c 



(4) 



Here g encodes the prior probability distribution over the 
time of curvature domination. 



3 



The second factor in Eq. (|3]) is the number of observers 
present within the causal patch, at the time i b s in a 
vacuum with parameters (t A ,t c ). It can be written as 

n ohs = Mcp{logt A ,logt c ;logt ohs )h(logt A ,\ogt c ;\ogt ohs ) , 

(5) 

where Mqp is the total matter mass present within the 
causal patch at the time i bsi and h is the number of 
observers per unit matter mass. In principle, h is very 
difficult to compute, but we will argue below that h is 
trivial in the only regime where it could possibly affect 
our result. In vacua with no observers at i bs, we set 
h = 0. 

We now turn to the computation of Afcp- Vacua are 
cosmologically produced as open FRW universes [H] with 
metric ds 2 = ~dt 2 + a(t) 2 (dx 2 + sinh^df^), embedded 
in an eternally inflating parent vacuum. The boundary 
of the causal patch is given by the past light-cone from 
the future end point of a comoving geodesic, which may 
be chosen to lie at x — by homogeneity, and which co- 
incides with the event horizon for long-lived metastable 
de Sitter vacua. It can be computed as if the de Sit- 
ter vacuum was eternal, as the correction from late-time 
decay is negligible: Xcp(*) = / t °° dt' /a(t'). 

If t c < t\, the universe contains a curvature dominated 
era. The evolution of the scale factor is governed by the 
Friedmann equation, a 2 /a 2 = i c /a 3 + a~ 2 + t A 2 , where 



t c /a 3 ~ p m is the energy density of pressureless matter. 
The remaining terms encode the curvature and the cos- 
mological constant. (The inclusion of a radiation term 
would not affect our results qualitatively.) A piecewise 
approximate solution is 

f tl'h 2 '* , t < t c 

o(t) ~ < t , t c <t<t A (6) 

{ t^' 1 ^ 1 , t A < t . 

Integration yields the comoving radius of the causal 
patch: 

r i + iog(uAc) + 3 [i - (t/t c y/ 3 ] ,t<t c 

Xcp(t) ~ < 1 + log(i A /*) , t c <t<t A 
{ e-*/ tA , i A < t 

(7) 

In the regime where curvature never dominates, t A <C t c , 
the flat FRW metric is a good approximation: a(t) ~ 
i c 1/3 i 2 / 3 (t < t A ); a[t) ~ ic /3 t A /3 e*/*A-i (f A < t ). Inte- 
gration, or setting i c — !• t A in Eq. Q, yields xcp- 

The mass inside the causal patch at the time t b s is 
M C p = p m a 3 V com = t c V com [xcp(tobs)}- The comoving 
volume inside a sphere in hyperbolic space, V^m, can be 
approximated by x 3 f° r X ^ 1 (i n the regime t A < t b s ), 
and by e 2x for x ^ 1 (i- e -j f° r > tabs)- Substituting 
into Eqs. ([3|-([5]), we find the probability distribution 



d 2 p 



d logi c dlogt A 



gh x < 



^c/^obs ' 
1/*A , 

f~ lp~ 3* bs/tA 



^obs <t c <t A I 

tc < t bs < t A II 

t c <t A < t b s III 

tabs <t A <t c V 

t A < t ohs and t A < t c IV 



(8) 



For the moment, let us ignore the unknown functions 
g and h, i.e., set gh constant. Then the probability dis- 
tribution is a function of powers and exponentials of t c 
and tA- Hence, it depends at least exponentially on the 
variables logt c and logtA, so it will be dominated by its 
maximum. The maximum can be found by following the 
gradient flow, which can be thought of as a "probabil- 
ity force" [131 [T7] pushing log t A and log t c towards their 
preferred values. In Fig. [2] arrows indicate whether logt c 
and logtA prefer to increase or decrease (or neither). The 
arrows do not reflect the precise direction and strength 
of the gradient. For example, in region III, the probabil- 
ity grows with t c , and since the exponential dominates, 
it also grows with t A ; this is indicated by a diagonal ar- 
row pointing towards larger \ogt c and log^A- (Recall 
that we are holding t h s fixed for now.) By following 
the arrows, we recognize that the probability density is 
maximal not at a point, but along two degenerate half- 
lines of stability: log^A = log^obs, logic ^ logiobs! an d 



logi c = logtobs, logtA > logiobs- 

Let us now include the effects of the two prefactors we 
have so far neglected, beginning with h, the number of 
observers per unit mass. Crucially, h cannot depend on 
i a for t a 3> iobs j and it cannot depend on t c for t c 3> t bs • 
This is because in these regimes, curvature or vacuum en- 
ergy have no dynamical effect prior to the time i b s , so 
they cannot be measured by anything, including h. For 
i c *C i bs, we will assume that h is an increasing function 
of t c ; similarly, for iA <C i bs, we will assume that h is 
an increasing function of tA- This assumption seems ex- 
traordinarily weak, in the sense that its converse would 
be preposterous. The effects of early curvature domina- 
tion and vacuum domination arc both gravitational, and 
thus universal. They limit the amount of matter than can 
cohere; the smaller t c or iA, the smaller the largest struc- 
ture in the universe will be. To deny our assumption is to 
claim that on average over many vacua, this disruption 
is correlated with an increased number of observers per 
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FIG. 2. The probability distribution over the timescales of 
curvature and vacuum domination at fixed observer timescale 
tobs, before the prior distribution over logt c and the finiteness 
of the landscape are taken into account. Arrows indicate di- 
rections of increasing probability. The distribution is peaked 
along two degenerate half- lines (thick). 



unit mass. (In fact, an even weaker assumption would 
suffice, namely that as t c or t\ are decreased below t bs ; 
h does not grow so rapidly as to overwhelm the factors 
that suppress the probability in this regime.) 

We thus arrive at a key intermediate result of this pa- 
per: the "anthropic factor" h is irrelevant. Its only pos- 
sible effect is to further suppress the probability distribu- 
tion in a regime whose integrated probability is already 
negligible, namely for t c <C i bs or tA <C tobs- Near the 
half-lines of maximal probability, which dominate the dis- 
tribution, h will at most contribute factors of order unity, 
which we neglect here in any case, and which will not lift 
the degeneracy along the two half-lines, because they can 
only depend on the variable orthogonal to each line. 

The other prefactor, g(logt c ), encodes the prior proba- 
bility distribution over the time of curvature domination. 
We will assume that g decreases mildly, like an inverse 
power of logt c . (If slow-roll inflation is the dominant 
mechanism responsible for the delay of curvature dom- 
ination, logt c corresponds to the number of e-foldings. 
If g decreased more strongly, like an inverse power of t c , 
then inflationary models would be too rare in the land- 
scape to explain the observed flatness.) This will tilt the 
arrows slightly left in regions IV and V, lifting the de- 
generacy along the horizontal half-line. Because of the 
logarithmic dependence, the peak at t c ~ t Q bs is very 
wide, so t c could be large enough for curvature to be un- 
observable in vacua with tA ~ iobs- For example, with 
g ~ (logt c )~ 4 , the probability of detecting curvature in 
a future experiment with sensitivity \Clk\ ~ 10~ 4 is only 
about 10% ngn. 



would render the probability distribution unintegrable, 
unless the effective number of vacua in the landscape, Af, 
is finite. By "effective" , we mean that vacua without 
observers, and vacua that are very suppressed cosmolog- 
ically, are not included in Af. The effective spectrum of A 
is discrete, with spacing of order A/" -1 , and the smallest 
positive value of A will be of this order. Correspond- 
ingly, there will be a largest value of t\, t™ ax ~ Af 1 / 2 . 
This "discretuum cutoff" reduces the half-line of maxi- 
mal probability to the interval 



log t c = log t obs , log t ohs < log t\ < log t 



max 
A 



(9) 



So far, we have treated the timescale of observers, t bs, 
as a fixed input parameter. We will now extend our 
analysis to determine the probability distribution over all 
three parameters, including log t bs- Part of this work has 
already been done, since our calculation of the mass Mcp 
within the causal patch took into account the dependence 
on t bs as well as on tA and t c . The number of observers 
per unit mass, h, also includes a t bs-dependence, though 
we have yet to make any assumptions about it. For the 
joint prior probability, we can write 



a?p 

dlog t A d log t c d log t obs 



= / 



dp 

dlog t A d log t c 



(10) 



where the last factor is given by Eq. Q, and / = 
p(t bs\tA,tc) is the conditional probability that observers 
exist at the time t Q bs in a vacuum with parameters 
(t\,t c ). We thus find a trivariate probability distribu- 
tion simply by multiplying Eq. ([8]) by the function /. 

To analyse this distribution, we will first marginal- 
ize over log t a and log t c to determine the most prob- 
able value of log tobs- Evaluating Eq. ([8| at this value 
then predicts logtA and logt c . We may restrict the in- 
tegral over log t c and log tA to the neighborhood of the 
line interval ([9]), which contains nearly all of the prob- 
ability at any fixed i bs, and in which we argued that 
neither / nor h can depend significantly on log t c or 
logtA- With g — (logt c ) _fc , k > 1, we find from Eq. S 
that the marginal probability distribution over logt b s 
is fhlt~ir, where I is a rational function of logt bs- 
Let us assume that the product fh grows more strongly 
than linearly with t bs, i-e., that fh ~ t ob ^ e ,with e > 
|26j With this assumption, the marginal distribution be- 
comes t obs Z(logt bs). This favors large values for t Q b s : 
log t bs ~ logt™ ax . At this value of logt b s , the interval 
of maximum probability ^ for (logtA, log t c ) shrinks to 
a single point, and we predict 



logt A ~ logtc « log tobs » log {Af 



1/2N 



(11) 



The degeneracy along the vertical half-line t c 



tn 



We thus predict that the number of vacua that con- 
tain observers and can be cosmologically produced is 
Af ~ 10 123 . The relation between Af and the total num- 
ber of vacua in the landscape is not trivial. String theory 
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contains infinitely many stable supersymmetric vacua, 
for example vacua of the form AdS p x S q , with arbitrary 
integer flux. Since we do not live in such a vacuum, it 
must be the case that they either cannot contain any 
observers, or that their integrated cosmological produc- 
tion rate is finite, for example because the production of 
very large flux values is highly suppressed. If the specula- 
tion that ten-to-the- millions of vacua arise from F-theory 
constructions [S7J holds up, then similar considerations 
would have to apply to explain why log AT is so much 
smaller. 
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